Fast accurate fuzzy clustering through data reduction

نویسندگان

  • Steven Eschrich
  • Jingwei Ke
  • Lawrence O. Hall
  • Dmitry B. Goldgof
چکیده

Clustering is a useful approach in image segmentation, data mining and other pattern recognition problems for which unlabeled data exist. Fuzzy clustering using fuzzy c-means or variants of it can provide a data partition that is both better and more meaningful than hard clustering approaches. The clustering process can be quite slow when there are many objects or patterns to be clustered. This paper discusses an algorithm, brFCM, which is able to reduce the number of distinct patterns which must be clustered without adversely affecting partition quality. The reduction is done by aggregating similar examples and then using a weighted exemplar in the clustering process. The reduction in the amount of clustering data allows a partition of the data to be produced faster. The algorithm is applied to the problem of segmenting 32 magnetic resonance images into different tissue types and the problem of segmenting 172 infrared images into trees, grass and target. Average speed-ups of as much as 59 to 290 times a traditional implementation of fuzzy c-means were obtained using brFCM, while producing partitions that are equivalent to those produced by fuzzy c-means.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis

Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...

متن کامل

Data-Driven Fuzzy Modeling: Transparency and Complexity Issues

Recently, the interest in data-driven approaches to the modeling of nonlinear processes has increased. Techniques based on fuzzy sets and rule-based systems have proven suitable mainly because of their potential to yield transparent models that are at the same time reasonably accurate. Many of the data-driven fuzzy modeling algorithms, however, aim primarily at good numerical approximation, whi...

متن کامل

Fast Fuzzy Clustering of Infrared Images

Clustering is an important technique for unsupervised image segmentation. The use of fuzzy c-means clustering can provide more information and better partitions than traditional c-means. In image processing, the ability to reduce the precision of the input data and aggregate similar examples can lead to significant data reduction and correspondingly less execution time. This paper discusses brF...

متن کامل

Evaluation of the nutritional effects of fasting on cardiovascular diseases, using fuzzy data mining

Background: Advances in information technology and data collection methods have enabled high-speed collection and storage of huge amounts of data. Data mining can be used to derive laws from large data volumes and their characteristics. Similarly, fuzzy logic by facilitating the understanding of events is considered a suitable complement to scientific data mining. Materials and Methods: The pre...

متن کامل

Dna Algorithm and Fuzzy Evolutionary Clustering for Image Reconstruction

DNA algorithm and fuzzy evolutionary clustering techniques are used to classify damaged images and to reconstruct the original images. Experimental results show both methods are far more effective than the use of genetic algorithms or c-means clustering. Particularly, the method of fuzzy evolutionary clustering provides very fast convergence and accurate image reconstruction with absolute certa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Trans. Fuzzy Systems

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2003